Automatic Semantic Role Labeling using Selectional Preferences with Very Large Corpora

نویسنده

  • Hiram Calvo
چکیده

OF PhD THESIS Automatic Semantic Role Labeling using Selectional Preferences with Very Large Corpora Determinación Automática de Roles Semánticos usando Preferencias de Selección sobre Corpus muy Grandes Graduated: Hiram Calvo Center for Research in Computing (CIC) National Polytechnic Institute (IPN) Mexico City, Mexico, 07738 [email protected] [email protected] Graduated on June 19th, 2006 Advisor: Dr. Alexander Gelbukh Computing Research Center (CIC) National Polytechnic Institute (IPN) Mexico City, Mexico, 07738 www.gelbukh.com Abstract We present a method for recognizing semantic roles for Spanish sentences. This method is based on dependency parsing using heuristic rules to infer dependency relationships between words, and word co-occurrence statistics (learnt in an unsupervised manner) to resolve ambiguities such as prepositional phrase attachment. If a complete parse cannot be produced, a partial structure is built with some (if not all) dependency relations identified. Evaluation shows that in spite of its simplicity, the parser’s accuracy is superior to the available existing parsers for Spanish. Though certain grammar rules, as well as the lexical resources used, are specific for Spanish, the suggested approach is language-independent. A particularly interesting ambiguity which we have decided to analyze deeper, is the Prepositional Phrase Attachment Disambiguation. The system uses an ordered set of simple heuristic rules for determining iteratively the relationships between words to which a governor has not been yet assigned. For resolving certain cases of ambiguity we use cooccurrence statistics of words collected previously in an unsupervised manner, whether it be from big corpora, or from the Web (through a search engine such as Google). Collecting these statistics is done by using Selectional Preferences. In order to evaluate our system, we developed a Method for Converting a Gold Standard from a constituent format to a dependency format. Additionally, each one of the modules of the system (Selectional Preferences Acquisition and Prepositional Phrase Attachment Disambiguation), is evaluated in a separate and independent way to verify that they work properly. Finally we present some Applications of our system: Word Sense Disambiguation and Linguistic Steganography.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ABSTRACT OF PhD THESIS Automatic Semantic Role Labeling using Selectional Preferences with Very Large Corpora Determinación Automática de Roles Semánticos usando Preferencias de Selección sobre Corpus muy Grandes

OF PhD THESIS Automatic Semantic Role Labeling using Selectional Preferences with Very Large Corpora Determinación Automática de Roles Semánticos usando Preferencias de Selección sobre Corpus muy Grandes Graduated: Hiram Calvo Center for Research in Computing (CIC) National Polytechnic Institute (IPN) Mexico City, Mexico, 07738

متن کامل

A Simple, Similarity-based Model for Selectional Preferences

We propose a new, simple model for the automatic induction of selectional preferences, using corpus-based semantic similarity metrics. Focusing on the task of semantic role labeling, we compute selectional preferences for semantic roles. In evaluations the similarity-based model shows lower error rates than both Resnik’s WordNet-based model and the EM-based clustering model, but has coverage pr...

متن کامل

Enriching a lexical semantic net with selectional preferences by means of statistical corpus analysis

Broad-coverage ontologies which represent lexical semantic knowledge are being built for more and more natural languages. Such resources provide very useful information for word sense disambiguation, which is crucial for a variety of NLP tasks (e.g. semantic annotation of corpora, information retrieval, or semantic inferencing). Since the manual encoding of such ontologies is very labour-intens...

متن کامل

Can Selectional Preferences Help Automatic Semantic Role Labeling?

We describe a topic model based approach for selectional preference. Using the topic features generated by an LDA model on the extracted predicate-arguments over the Chinese Gigaword corpus, we show improvement to our state-of-the-art Chinese SRL system by 2.34 F1 points on arguments of nominal predicates, 0.40 F1 point on arguments of verb predicates, and 0.66 F1 point overall. More over, simi...

متن کامل

Generalizing over Lexical Features: Selectional Preferences for Semantic Role Classification

This paper explores methods to alleviate the effect of lexical sparseness in the classification of verbal arguments. We show how automatically generated selectional preferences are able to generalize and perform better than lexical features in a large dataset for semantic role classification. The best results are obtained with a novel second-order distributional similarity measure, and the posi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computación y Sistemas

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2008